منابع مشابه
Stylistics: Corpus Approaches
Stylistics, which may be defined as the study of the language of literature, makes use of various tools of linguistic analysis. Corpus linguistics is opening up new vistas for the study of language, and there are interesting similarities in the approaches of stylistics and corpus linguistics. Stylistics is a field of empirical inquiry, in which the insights and techniques of linguistic theory a...
متن کاملThe Design of Text Signatures for Text Retrieval Systems The Design of Text Signatures for Text Retrieval Systems
Signature files are one technique for indexing documents for full-text retrieval systems. This paper discusses two methods for generating text signatures – the word fragmentation and the pseudo-random generation techniques. The paper evaluates the effectiveness and efficiency of generating text signatures using these techniques. It also determines the optimal set of characteristics that define ...
متن کاملHarvesting for Full-Text Retrieval
We propose an approach to Distributed Information Retrieval based on the periodic and incremental centralisation of full-text indices of widely dispersed and autonomously managed content sources. Inspired by the success of the Open Archive Initiative’s protocol for metadata harvesting, the approach occupies middle ground between: (i) the crawling of content, and (ii) the distribution of retriev...
متن کاملDocument retrieval and text retrieval
DR systems illustrate every variety of indexing language, request and document description, and search mechanism. Controlled languages (CLs) have been commonly used, across the range from only slightly restricted natural language (NL) to a carefully designed artificial language. With CLs professional indexing is required and professional searching is the norm. However automatic DR systems have ...
متن کاملLEXICALIZING COMPUTATIONAL STYLISTICS For Language Learner Feedback
Computational stylistics refers informally to a collection of tasks within computational linguistics that deal with the style—as opposed to the semantic content—of natural language. The most famous of these tasks is perhaps authorship attribution (Stamatatos et al., 2001), which uses statistical variations in word choice to select the most likely from a fixed set of potential authors. Though ap...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM SIGIR Forum
سال: 2006
ISSN: 0163-5840
DOI: 10.1145/1189702.1189710